REMARKS 

Obviousness Rejection 

Claims 1-33 are rejected as obvious over Ju et al. (7,146,320) in view of Atkin, et 
al., US 2007/0276667. The rejection should be withdrawn in view of the following 
remarks. 

Claim 1 

In claim 1, tags are inserted into the email which separate content from one source 
or sender ("first source") from the content of another source or sender ("second source"). 
The tags are detected (e.g., in a voice command platform that features a text to speech 
engine for rendering email as speech) and the content from the first source is rendered in 
a first voice mode (e.g., with a man's voice) and content from the second source is 
rendered in a second voice mode (e.g., with a woman's voice). The feature of the tags 
and rendering content from different sources or senders in different voice modes helps 
the email recipient understand the breaks in the email between senders and distinguish for 
example an original email from a reply. 1 

Ju describes a feature wherein a user of a telephone may respond to an email by 
means of speech. In the Ju system, the e-mail from the sender is rendered as speech, but 
there is no email content from a second source/sender (i.e., an email from another party) 
which is also rendered in speech, as required by claim 1. In the passage at columns 5 
and 6 of Ju, the tags <questions> and <answers> do not delimit email content from 

1 The term "content . . . provided by a first source" and "content . . . provided by a second source" is 
understood from the applicants' specification as referring to email content, such as text, which originates 
from a person or entity which generates and sends an email. It can refer to an original email, or a reply to 
an email, or a reply to a reply, etc. 
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different senders. The recipient of the email only hears content from one source— the 
question that is presented ("are you able to come home for dinner?"). They have not 
even answered the question yet. When they do provide the answer, via speech, the 
speech is converted to text and the original sender receives a text email message 
containing their original question, and the answer. See col. 7 lines 7-9. 

Furthermore, even if the proposed answer structure (yes, no, maybe) could 
arguably constitute "second content in said email provided by a second source", and 
applicants submit that it does not fall within the scope of that term, the system of Ju does 
not provide a system for supporting two different voice modes, as recited in claim 1, or 
rendering the question in one voice mode and the proposed answer structure in a second 
voice mode. Atkin does not suggest that Ju should be modified in this regard. 

In particular, Atkin is silent on the concept of rendering emails as voice. Atkin' s 
disclosure is directed to the different fields of rendering of Internet content as voice, and 
rendering content from a text file from a storage device such as a diskette. See Aktin, 
paragraphs 5, 9. Aktin does not suggest that its teachings are applicable to the subject 
matter of rendering email content. 

Furthermore, a person skilled in the art would not look to Aktin for teachings 
applicable to the system of Ju since Ju does not recognize the problem that is solved by 
this invention - the rendering of email content from different sources in a manner that 
highlights the content from different senders. Neither Ju nor Aktin recognize that when 
rendering email content as speech, including content which includes content from 
different senders, there is a need to separate such content with tags and use such tags to 
render content from different senders in different voice modes. Ju is completely silent 
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on the concept of rendering content in different voice modes. Ju is directed to a method 
for enabling a person to reply to a voice mail with speech, and is fundamentally directed 
to the problem of enabling replies to email as speech, and is not concerned or even aware 
of the issues that can arise just be rendering email content as speech, let alone generating 
a voice response. 

Accordingly, it cannot be maintained that modification of Ju in view of Atkins 
was obvious to a person skilled in the art. 

Claim 2 

Claim 2 states that the second content comprises a reply email message to the 
original email message, this second content is rendered as speech in a different voice 
mode per claim 1. In rejecting this claim the Examiner cites to col. 6 lines 12-26 of Ju. 
This passage does not support the rejection. In Ju, the reference is stating that the server 
processes the original email message and the proposed answer email message, but this 
proposed answer message (options of yes, no, maybe), is not a reply to the original email. 
It is a set of proposed responses which are provided to the recipient. The system does not 
render a "reply" to the original email message (are you able to come home for dinner) 
since the user has not even entered their response, and then again the response that they 
speak is not rendered to them, they are generating it for transmission to the original 
sender. The original sender receives their original email content and the reply as text, 
not speech as stated in col. 7 lines 6-8. 
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Claim 5 

Claim 5 is similar to claim 1 and describes a method which requires identifying an 
original email message and a reply message, rendering the original email in one voice 
mode and the reply message in a different voice mode. The remarks for claims 1 and 2 
are applicable to the rejection of claim 5 and incorporated by reference here. 

Claim 6 

As to claim 6, this claim requires processing the email to identify content from a 
third source, and rendering the content in a third voice mode different from the first and 
second voice modes. The Examiner's analysis of claim 6 is limited to the subject matter 
of the original content and the reply, and does not address the additional subject matter of 
claim 6. Furthermore, the passage at col. 6 lines 12-26 makes no mention of either (a) 
identifying in an email message content from a third source or sender, or (b) rendering 
such content in a third voice mode. The Examiner has not established prima facie that 
claim 6 is obvious. 

Claim 8 

Claim 8 depends from claim 5 and further adds that the method also include steps 
of "(1) processing said email message to detect at least one of the following: a) a 
signature block to either said original email message or said reply message, b) a privacy 
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notice, or c) a confidentiality notice; and (2) not rendering at least one of said detected 
signature block, confidentiality notice or privacy notice as speech." 

The rejection of claim 8 is not well founded. The Examiner lumps claim 8 into the 
group of claims including claims 4, 22, 26 and 31. The Examiner cites to Ju, col. 6 lines 
27-54). These passages are directed to the language model and where it may reside or be 
retrieved from, and have nothing to do with what is claimed in claim 8. The Examiner 
does not attempt to find any teaching in either Ju or Aktin that an email server processes 
email to detect a signature block, confidentiality notice or privacy notice and then is able 
to not render such detected block or notice. These references do not disclose this 
invention of claim 8. 

The Examiner states in rejection claims 11 and 12 that it is well known that a 
signature block and privacy and confidentiality notices in email are not converted to 
speech. The applicants dispute this. The Examiner is required to come up with a citation 
to a reference that teaches this subject matter. The Examiner can only rely on official 
notice of a fact being "well known prior art" when the facts asserted as being known are 
"capable of instant and unquestionable demonstration as to defy dispute." MPEP 
2144.03, section A (citing case law). Furthermore, if official notice is taken, the 
technical line of reasoning underlying a decision must be clear and unmistakable. Id., 
section B. So such line or reasoning has been set forth. If the applicant challenges it, as 
here, the Examiner must support the finding with adequate evidence. Id., Section C. 
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Claim 9 

Claim 9 depends from claim 5 and states "wherein said reply message comprises 
a voice memo in response to a specific portion of said original email message, and 
wherein the method further comprises the step of processing said email message to detect 
a tag associated with said voice memo, and responsively rendering said voice memo as 
speech." In rejection of claim 9, the Examiner has cited to Ju at col. 6 lines 27-54. This 
passage says nothing about detection of a tag associated with a voice memo in a reply 
email or rendering this voice memo as speech. The passage cited by the Examiner is 
directed to providing a language model to a speech recognition module (step 164 of 
Figure 3). The reference states in this passage several possible options for providing the 
language model, and notes that it could originate at the sender's computer, at the email 
server or at a remote server. This passage says nothing about detection, within a reply 
message, of a voice memo. The rejection is not sound. 

Claim 10 depends from claim 9 recites that the tap detected in claim 9 is a pointer 
to an object comprising the voice memo. This subject matter is also not found at all in Ju 
at col. 6 lines 27-54. 

Claims 11. 12. 13, Hand 15 

The remarks for claim 8 apply to claims 11-14. The Examiner is required to 
furnish a citation of a teaching in Ju or Atkin of the claimed subject matter. 

Claim 16 
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Claim 16 is in Jepson form and describes an improvement to an email server in 
the form of a machine readable storage medium storing instructions for parsing an email 
to detect email content and signature blocks, and inserting a tag in the email message that 
separates email content from the signature block. Nothing of the sort is described in 
either Ju or Atkin. The Examiner resorts to handwaving, saying that it is known to not 
convert a signature block to speech. The Examiner is required to cite to an email server 
that includes instructions that both parses an email to detect email content and a signature 
block, and instructions to insert a tag into the email that separates the email content from 
the signature block. Applicants dispute that such a server is unquestionably known. See 
the response to claim 8, supra. 

Claim 17 

Claim 17 is in Jepson form and is directed to an improved email server which 
includes instructions for parsing an email to detect content from a first source (e.g., 
original email author) and second source (recipient of the original email), and instructions 
inserting a tag in the email message that separates the email content from the first source 
from the email content of the second source. As explained in the specification an email 
with these tags can be used by the voice command platform to render the content from 
the first source in one voice mode (accent, pitch, gender, etc.) and the content from the 
second source in a different voice mode. The remarks for claim 1 are incorporated by 
reference here. Ju is silent on parsing email as claimed or inserting tags to delineate 
email content from different sources. 
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Claim 18 

Claim 1 8 is similar to claim 17 but instead of inserting tags to distinguish between 
content from different sources the email server includes instructions to insert tags that 
separate content from confidentiality and/or privacy notices. The remarks for claims 12 
and 13 are incorporated by reference here. Ju does not parse email and inserts tags as 
recited in claim 18 to distinguish between email content and privacy and/or 
confidentiality notices. 

Claim 19 

Independent Claim 19 is directed to a method of allowing a recipient of an email 
message to respond via voice, and involves receiving a signal from a recipient indicating 
that the recipient intends to respond to the email message by inserting a voice memo at a 
particular location in the email message. The signal is received while the email is being 
rendered as speech. The Examiner has not cited to anything in Ju that discloses this 
feature. See the response to the rejection of claim 1 , which is reincorporated here. 

In Ju, the recipient of the original email indicates their response to the email via 
speech, and their response is not rendered to the original source as speech (as required by 
claim 19). Rather, the original source receives a text email. Ju, col. 7 lines 6-8. 

Claim 20 

Claim 20 depends from claim 19 and states that the provides a signal in the form 
of a DTMF tone (touch tone) to indicate that the recipient intends to respond to the email 
message by inserting a voice memo at a particular location in the email message. The 
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Examiner has not cited to a teaching applicable to claim 20. The Examiner lumps claim 
20 into other claims and cites to col. 6 lines 12-26 of Ju. In this passage, the reference is 
describing steps that occur to transmit the original email to the recipient, not steps that the 
recipient takes to indicate they want to insert a voice memo. The passage is also silent on 
voice memos, insertion of voice memos at a particular point in an email, and also the use 
of a DTMF tone to indicate the intention to insert a voice memo. The rejection of claim 
20 is not well founded. 

The rejection of dependent claims 21-33 should be withdrawn in view of the 
above remarks to the applicable independent claims or related dependent claims. 



Favorable reconsideration of the application is requested. 



Respectfully submitted. 
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